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Abstract. We give a new proof of Rudolph's multiple term return times theorem based on 
Host-Kra structure theory. Our approach also yields a multiple term Wiener-Wintner-type 
return times theorem for nilsequences. 



1. Introduction 

In this article we are concerned with universally good weights for pointwise convergence 
of ergodic averages, i.e., sequences (a„) such that, for every measure-preserving system 
(7,S) and every g e L°°(Y), the averages 



1 



N N , 

n=l 

converge for a.e. y e 7. Bourgain's return times theorem rBF K089(] asserts that, given any 
ergodic measure-preserving system {X, T), for every / e L°°(X) and a.e. x gX the sequence 
of weights a„ = f{T^x) is universally good for pointwise convergence. The name "return 
times theorem" comes from the case of a characteristic function f — 1^, A ci X. Then 
the theorem can be equivalently formulated by saying that, for a.e. x &X, the pointwise 
ergodic theorem on any system Y holds along the sequence of return times of x to A. 

This has been extended to averages involving multiple terms by Rudolph ||Rud98 l|. In 
order to formulate his result and for future convenience we now introduce some notation. 
By a system we mean an ergodic regular measure-preserving system {X,ix, T) with a distin- 
guished countable subset D c L°°(X) that is sufficiently large (we will formulate the precise 
condition on D in Defintion l2.8D . 

Definition 1.1. Let P be a statement about ergodic regular measure-preserving systems 
(Xj, jUj, Tj), functions /; e L°°(X;) and points X; &X^, i = 0,...,k. We say that P holds for 
universally almost every (u.a.e.) tuple Xq,...,x^ if 

(0) For every system (Xq, jUq, Tq, Dq) there exists a set of full measure Xq c Xq such that 

(1) for every system (X^, jji^, T^, D^) there exists a measurable setX^ c Xq xX^ such that 
for every Xq g Xq the set {x^ : (xg, x^) G X^} has full measure in X-^ and 

(fc) for every system {X^,ijl^, T^,DjJ there exists a measurable set Xj. (Z Xq x ■ ■ ■ x X^ 
such that for every x^_i e X]^_i the set {x;^ : (Xj^_i, X;^) ^X^} has full measure in Xj^ 
and 

we have P(Jq, ...,/;,, Xj^) for every Xj^ e Xj^ and every G D^, i — 0, . . . ,k. 
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With this convention Rudolph's multiple term return times theorem says that for every 
fc e N the averages 

1 ^ 

a.2) -2/o(ro%)---A(r,"x,) 

n=l 

converge for u.a.e. Xq, ...,Xj^. We call this statement RTT(k). Birkhoff's pointwise ergodic 
theorem llBir31ll is essentially RTT(O) and Bourgain's return times theorem is RTT(l). More 
about the history of these and related results can be found in a recent survey by Assani and 
Presser IIAP12all . 

It is known that the Host-Kra-Ziegler nilfactor ^^CXq) is characteristic for RTT(k) in the 
sense that if /q -L 2^j^{Xq), then the averages (I1.2D converge to zero u.a.e. [ AP12b[ Theorem 
4]. However, the proof of this fact hitherto depends on the convergence result RTT(k). 

In this article we prove both results, RTT(k) and characteristicity simultaneously by 
induction on k using the Host-Kra structure theory. We also obtain the following Wiener- 
Wintner return times theorem for nilsequences thereby generalizing ||ALR95[ Theorem 1]. 

Theorem 1.3 (Wiener- Wintner return times theorem for nilsequences). Let k,l G N and 

/; e L°°iXil i = 0,...,k. Then for u.cL.e. Xq, . . . , Xjt- and every l-step nilsequence (Qn)^ the 
averages 



converge (to zero if in addition /q -L 3ffc+;(Xo) or /; _L 2^j^_^.i+i_i{Xi) for some i — 1, . . . ,k). 

The general strategy is to consider not only the X^'s but also the Host-Kra cube spaces of 
all orders simultaneously and to use the convergence criterion due to Bourgain, Fursten- 
berg, Katznelson and Ornstein rProposition l2.11D to pass from k to k+1. After a preparatory 
Section [2] we formulate our central convergence Theorem 13.21 on cube spaces, generalizing 
RTT(k) and giving some information about characteristic factors. The remaining part of 
Section |3] is devoted to the proof of Theorem l3.2[ Theorem 1 1.3 1 is then proved in Section |4l 

2. Notation and tools 

Ergodic decomposition. For the purposes of this article we find it illuminating to think of 
the ergodic decomposition in a particular way (that will be generalized in Section [3]). Let 
(X,jLt, T) be a regular ergodic measure-preserving system, i.e. X is a compact metric space, 
T :X is an invertible continuous map and /i is a T -invariant ergodic Borel probability 
measure. By the pointwise ergodic theorem a.e. x gX is generic for some T-invariant Borel 
probability measure m^ onX, i.e. Sn=i/(^-^"-^) ~* //dm^ for every / e C(X). It follows 
easily that the function x >— > m^ is measurable and 

r 



(2.1) ^Ji 



m^d;a(x) 



In particular, for ji-a.e. x the measure m.^ is defined for m^-a.e. y. To see that m^ is ergodic 
for jJL-a.e. x it suffices to verify that 



(2.2) 



/dm - 



/dm. 



2 



dm^(y)d/i(x) = for every / e C(X), 



since this says precisely that the ergodic averages of / converge pointwise m^-a.e. to an 
m^ -essentially constant function for pt-a.e. x, and the latter full measure set can be chosen 
independently from / since C{X) is separable. By definition of m^,m^, the dominated 
convergence theorem and (12. ID we can rewrite the integral in (12. 2D as 



21im 

N 



=1 J n=l J n=l 

N f I ^ M 

i77T^T"fnx)d^Ji{x) - 21imlim (-^ rYX^X-^ T'"/)(x)dKx), 



n=l 

r 
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= 21im 

JV 



N M N 

n=l n=l n=l 

and this vanishes by the pointwise ergodic theorem and the dominated convergence theo- 
rem. 

Host-Kra structure theory. We recall the basic definitions and main results surrounding 
the uniformity seminorms | |HK05| | . Let (X,jU, T) be a regular ergodic measure-preserving 
system. The cube measures iJi^^'^ onX^'^^ : — X^ are defined inductively starting with iJi^^^ :— 
yL. In the inductive step, given yi^^^, fix an ergodic decomposition 

r 



m^djLt'^'^(x) 



as in (12. ID . The space on which m^ is defined can be inferred from the subscript x. Define 

r 



(2.3) 



5^®m.^dLijP^^{x). 



To see that this coincides with the conventional definition one can use (I2.1D and (12. 2D to 
write the above integral as 



(2.4) M^'+'^ = 



5y (8)mydm^(y)d;if'^(x) 



5 (8)mj,dmj,(y)d/x'-'^(x) 



m^ (8) mj,dpt'-'^(x). 



The uniformity seminorms are 

r 

(2.5) "^"2'- 



i+i 
iji+i 



where ^^^^ is the T^^'^ -invariant sub-cr-algebra on X^^^. For /"^ e L°°(X), e e {0,1}', we 
will abbreviate f^^^ := <8)ee{o,i}'/'^- The uniformity seminorms satisfy the Cauchy-Schwarz- 
Gowers inequality | |HK05l Lemma 3.9.(1)] 

r 



(2.6) 



eG{0,l}' 



The [/'"'"^-seminorms determine factors 2f;(X) by the relation 



flL\2fiiX)) 



ym =Ofor/ eL°°(X). 



The factors 3f,(X) are inverse limits of rotations on Z-step nilmanifolds (in fact, topological 
inverse limits [HKMIO', Theorem A.l]). 

We also need the classical fact that the Kronecker factor is characteristic for conver- 
gence of ergodic averages with arbitrary bounded scalar weights, see e.g. IIHK091 Corollary 
7.3] for a more general version. 

Lemma 2.7. Let (X, T) be an ergodic measure-preserving system andf e L^{X) be orthogonal 
to ^liX). Then for any bounded sequence (a„)„ one has 



1 ^ 

lim — Va„T"/=0 in L^(X). 
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Conventions about cube measures. In the sequel we will have to consider systems for 
which a certain approximating procedure can be carried out within their distinguished sets. 

Definition 2.8. A system is a regular ergodic measure-preserving system iX,iJL, T) with a 
distinguished set D c L°°(X) that satisfies the following conditions. 

(1) (Cardinahty) D is countable. 

(2) (Density) D contains an L°°-dense subset of C(X). 

(3) (Algebra) D is closed under pointwise multiplication and absolute value. 

(4) (Decomposition) For every f & D and I e N there exist decompositions 

Dec(0 f^fl+Uj+ferrJ, J ^ N, 

such that f±,f^j,ferr,j ^ 1. 3f;(X), f^j G C(Zj), where Zj is a nilfactor of 

ll/errjlli-c/x) IS Uniformly bouuded in j and WferrjWiKix) ^ as j ^ oo. 

Clearly, for any regular ergodic measure-preserving system (X, jji, T) any countable subset 
of L°°(X) is contained in a set D that satisfies the above conditions. 

Lemma 2.9. Let {X,jji, T,D) be a system. Then there exist measurable subsets 7; c X^'-^ such 
that for every I e N the following statements hold. 

(1) jLt'^'^(Y;) = 1 and for every y e 7; we have my(Yi) — 1. 

(2) For every y e Y; the measure my is ergodic and one has 



(2.10) ^y®^y 



(3) 7; c (X)^^\ where X <zX is the set of points that are generic for each f G D w.r.t. ji. 

(4) For every y e Yi, every fc e N and any functions f^ G D, e G {0,1}'^ such that 
/, 1 2^^+iiX) for some e we have f^^^ 1 2^^(X^^\my). 

Proof The fact that the last condition holds for full measure sets of y, follows from the 
Cauchy-Schwarz-Gowers inequality (12.61) . The remaining conditions can be achieved using 
(I2.4D and the pointwise ergodic theorem. □ 

Bourgain-Furstenberg-Katznelson-Ornstein criterion. Our main tool for proving conver- 
gence u.a.e. is the following criterion that reduces the search for a universal set of x G X 
(that a priori involves uncountably many systems 7) to a problem about X^. 

Proposition 2.11 ( IIBFK0891 Proposition]). Let (X, T) be an ergodic measure-preserving sys- 
tem and f G L°°(X) n 2^^{X)-^. Assume that x gX is generic for f and 



1 ^ 

-2/(r"x)/(r"?)-0 fora.e.^GX. 

Then for every measure-preserving system (7, S) and g G L°°(7) we have 

1 

-2/(r"x)g(S"y)-0 fora.e.yGY. 

This has been generalized to averages along F0lner sequences in countable amenable 
groups satisfying the Tempelman condition IIOW921 §3]. It seems plausible that a corre- 
sponding statement for tempered F0lner sequences (for which the pointwise ergodic theo- 
rem is known to hold by IlLinOliD would also be true, but we concentrate on the standard 
Cesaro averages. 
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A measure-theoretic lemma. The next lemma is our main tool for dealing with cube mea- 
sures. Informally, it shows that a certain kind of universality for jJL^^^ (8) v'^^^ implies some 
universality for (^i x v)'-^-'. 

Recall that, for ergodic measure-preserving systems {X, jji), (Y, v), the projection onto the 
invariant factor of X x Y has the form 4>{x, y) — xp{7ii{x), 7ii(y)), where tt^ are projections 
onto the Kronecker factors and xp is the quotient map of x 2fi(Y) by the orbit closure 

of the identity. To see this, recall that by Lemma 12.71 the function f ® g, f s L°°{X), 
g e L°°(y), is orthogonal to the invariant factor of X x 7 whenever / ± ^liX) or g ± 
S'i(Y). Thus the invariant sub-cj-algebra onX x 7 is contained in x i.e. it is 

(isomorphic to) the invariant sub-cj-algebra of a product of two compact group rotations 
(cf. e.g. [|Rud95i Theorem 1.9]). In particular, for an ergodic system Y the invariant factor 
of 7 X 7 is isomorphic to 5fi(7). 

Lemma 2.12. Let (X, ji), (7, v) be ergodic measure-preserving systems and fix measure disin- 
tegrations 



jJL = M^djc, V = 

This induces an ergodic decomposition 

r 



V (g) V 



(v(8)v);^dA, (v(8)v);i 



V 



v,/,-idA'. 



Let X gX and A c 2f^(7) be a full measure set. Assume that for jji-a.e. E, and every A e A, for 
(v ® v);)^-a.e. (rj, rj') one some statement E,, rj, rj') holds. Then P(x, E,, y, 17) also holds for 
v-a.e. y and m^^-a.e. (<^, 17), where 

c 



m 



\i^®v^A.{k, A), 

Ke2fi(X),Aea"i(y):^/)(7ri(x),7ri(y))=V'(ic,A) 

the homomorphism xp is as above and the integral is taken over an ajfine subgroup with respect 
to its Haar measure. 

Proof Recall that kerip has full projections on both coordinates. Therefore, for every x there 
is a full measure set of E, such that the set A has full measure in {A : 'ip(ni{x)ni{E,)~^ , A) = 
id} (note that this is a closed affine subgroup of ^i(7) that therefore has a Haar measure). 

In particular, for a full measure set of E, (that depends on 7) the hypothesis holds for 
a.e. A with 'ip(n^{x)Tii(^)~^ , X) — id, i.e. we have Fix, •) for a set of full measure w.r.t. the 
measure 



(v(8)v);^dAd/i(0 



Aeiri(y):V'(7ri(x)7ri(?)-M)=id 

r 



/x^ (8) (v ® v);;LdAdK- 



r f 



V 



A' Vy^-idX'dXdK 



5y ® Vn(y)X-^ 



dv(y)dAdK- 



Ke2fi(Z) J Ae2fi(y):i/i(7ri(x)ic-i,A)=id 



^^®5y® v^^y^^-idMKdviy) 
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y 

jey J h-eiri(X),AeS-i(y):^(7ri(x),7ri(y))=V'(K,A) 

r 



JyeY 

This gives P(x, E,, y, 17) for v-a.e. y and riij^^-a.e. pair (<^, 17) as required. □ 

The next lemma provides us with means for using the measure m^^ in a higher step 
setting. 

Lemma 2.13. Let {Z,g),iZ',g') be ergodic nilrotations and xp : ^i(Z) x ^^{Z') H the 
factor map modulo the orbit closure of (7ii(g), ^lig'))- Then for every A e 2^i(Z) and a.e. 
"X! e 3fi(Z') the rotation by {g,g') on the nilmanifold 

is uniquely ergodic. 

Proof By [|Lei05l 2.17-2.20] it suffices to prove ergodicity to obtain unique ergodicity. 

Since Np^;^^! only depends on i/j(A, A') and kevxp has full projection on 3fi(Z) it suffices 
to verify the conclusion for a full measure set of (A, A')- For this end it suffices to check 
that for any / e C(Z),/' e C(Z') the limit of the ergodic averages oi f <S> f is essentially 
constant on N;^j>^^. We decompose / ^ f^+f^e with 1 3fi(Z) and f^ e L°°(3fi(Z)), and 
analogously for For f^ (8)/^ the limit is essentially constant on N^ ;^i for any (A, A') since 
the rotation is ergodic on (ti^ x 7ii)(iV;^ a')- 

On the other hand, the limit of the ergodic averages of tensor products involving fj_ 
vanishes on Z x Z' a.e. by Lemma [2?7l hence also a.e. on a.e. fiber Af;^^ ;^'. □ 

3. Return times theorem on cube spaces 

In order to concisely state our central result, Theorem 13.21 we need a cube version of 
Definition [TTTl Recall that we write //'^ = ^^gjcil'/i.e. where /; ^ e L°°(X;). 

Definition 3.1. Let P be a statement about ergodic regular measure-preserving systems 
(X;,//;, Tj), functions and points X; e xl^\ i — 0,...,k. We say that P holds for [l]- 
universally almost every ([l]-u.a.e.) tuple Xq,...,x^. if 

(0) For every system (Xq, ^Xq, Tq, Dq) there exists a measurable set X^'^ c X^'^ such that 
for every Jq ^ Yq i we have m^^(Xg'^) = 1 and 

(1) for every system {X^,jJL^, T^,Di) there exists a measurable setxj'-' c X^^ xx[^^ such 
that for every Xq e X^^ and every y^ e ; we have m.^,^ {x^ : {xq, Xi) gX^} — 1 and 

(k) for every system (X^, yi^, Tj., D^) there exists a measurable set X^'-' c X^^ x • • • x X^'-' 
such that for every x^_'^ e X^'i^ and every jj. e 7;^ , we have my^{xy. : (x;^.^,^;^) e 
Ifc} = 1 and 

we have P^f^^, ■ ■ .,fl^^,x{) for every Xy. gX^. and any /j^ e D;, < i < fc, e e {0, 1}'. 
With this definition "u.a.e." corresponds to "[0]-u.a.e.". 

Our main theorem below states that certain nilfactors are characteristic for return time 
averages on cube spaces. 

Theorem 3.2. For any k,l G N the ergodic averages of (8)|^^g//'"' converge [l]-u.a.e. If in 
addition 

CF(/c, 36 G {0, 1}' s.t. 4, 1 ^fc+,(Xo) or f,^, 1 ^^^,+^_,{X,) for some l<i<k 
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then the Umit vanishes [l]-u.a.e. 

The case Z = of Theorem 13.21 is RTT(k) with additional information ICFT fc. 0) about 
characteristic factors. 

The proof is by induction on k and the inductive loop of the proof spans this whole 
section. The base case k — follows by definition of Yqj and the pointwise ergodic theorem. 

Thus we assume Theorem 13.21 for some fixed k G N and prove all intermediate results 
for this value of k assuming that they hold for lower values of k. The base cases of the 
intermediate results will be proved as we state them. 

In order to pass from to + 1 in Theorem 13. 21 we write 

(3.3) x...xx;'+^] = (xi'] x...x4'])2=:x2. 

Our aim is to apply Proposition 12.111 with this X and Y — X^^^. The remaining part of this 
section is devoted to verification of the hypotheses of that proposition. This involves the 
following steps. First we use RTT(k) to construct a certain universal measure disintegration 
with built-in genericity properties on a product of ergodic systems (Theorem 13. 4D . We use 
characteristic factors for RTT(k) to represent measures in this disintegration in a different 
way. Finally, we verify a certain instance of RTT(k-l-l) (Lemma [3.12D . 

Universal disintegration of product measures. The return times theorem can be seen 
as a statement about measure disintegration, cf. [|ALR95[ Theorem 4] for the special case 
fc = 1. 

Theorem 3.4. Let (X^, jji^, T^,D^), i — 0, . . . , k, be systems. Then [l]-u.a.e. Xg, . . . , Xj, is generic 
for some measure m^^ on X^^ x • • • x X^'-' and every function <8)|Lo//'^^ fi,£ ^ ^i- 
Moreover, for [l]-u.a.e. Xq,..., Xj^_i and every y^. e Yi^. one has 

Proof By Theorem 13. 21 with I = we obtain convergence of the averages 



1 



for [/]-u.a.e. Xq,...,x^. and any /^^ e D^. For continuous functions f ,, e D; we define 
n^xo,...,xt('^iLo/i '"'"') limit of these averages. By the Stone-Weierstraft theorem these 

tensor products span a dense subspace C{Xq^ x- • -xX^^^), so by density the above (bounded) 
linear form admits a unique continuous extension. 

In order to obtain (13. 5D it suffices to verify that the integrals of functions of the form 
®iLo//'^' fi,e ^ ^i' with respect to both measures coincide. By genericity and the dominated 
convergence theorem we have for [Z]-u.a.e. Xg, . . . , Xj,_^ that 

2';<j/'^®//'^clm,„,.,.,,^dm3,^(x,) 



n=l i<k 



limiVn//'^(7^.%) 



n=l Kk 



//'](r,"x,)dm^^(xj 



r 



8 



PAVEL ZORIN-KRANICH 



as required. □ 

Properties of the universal disintegration. We will now represent the measure m^^ 
for [Z]-u.a.e. Xq,...,x^. in the form m^ -^ in the notation of Lemma [2.121 At this step we 
have to use the information about characterisitc factors. We begin with a preliminary ob- 
servation. 

Lemma 3.6. If some property P holds [l]-u.a.e. then, for [l]-u.a.e. x^, . . .,x,^, P holds m^^^ 
a.e. 

Proof For k — this follows from (13.5ft . Assume that the conclusion is known for k — 1 and 
show it for k. 

By the induction hypothesis, for [Z]-u.a.e. Xq,..., x^^.j, m^^ ^^_^-a.e., for every jj^ e Y^j, 
P holds m^^-a.e. in x^. The conclusion follows from (13.5ft . □ 

Lemma 3.7. For [l]-u.a.e. Xq,...,x^. we have 



where we use the notation ofLemma \2.12\ with {X, ju) = (Xq^ x • • • xx['^^, t^xo,...,x^-i)' 0^' — 

m^,X X = (xo, . . . , Xfc_i) and y = x^. 
Proof To verify that the measures coincide it suffices to check that the integrals of functions 
of the form (2>|Lo//'^ fi,e ^ coincide. For this end consider the splittings = /;^^ + 

fi,e,T,j+fi,e,err,j, j ^ glveU bvlDecffc + l + l-i). 

Projections of tensor products that involve ^ ^ on one of the Kronecker factors vanish 
a.e. for [Z]-u.a.e. x,y by Corollary 13.131 for k — 1 that is part of the induction hypothesis 
for this section. Since kerxp has full projections on both coordinates the corresponding 
integrals w.r.t. m^^ also vanish. The integrals wrt. m^^ vanish for [Z]-u.a.e. x,y by 
Theorem 13. 21 

For the main terms we have 

(3.8) J 

^(®f=oVSjl^i(^))('^M//i;l^i(^))(W'c,A) 

Ke3!"i(X),Ae2-i(y):V(?ri(x),7ri(j))=V'(h-,A) 

Since the underlying nilmanifold of a nilrotation is a bundle of nilmanifolds over its Kro- 
necker factor, the conditional expectation above is just integration in the fibers, and by 
uniqueness of the Haar measure the whole integral equals 

®froV.S/'c)//'i/A)d(K,A), 

where Z, is the orbit closure of x in Ff*^ 1 Z^^^ and Z'. is the orbit closure of y in zl^\ By 
Lemma [2. 131 the above fibers of Zj x Zj are uniquely ergodic for every x and a.e. y, and 
the integral then equals 

N r 
n=l J 

It remains to treat the error terms, i.e. the case /;/^/ = e'.errj for some i', e'. By Lemma [2T9] (l3]). 
for [Z]-u.a.e. x,y we have 

r 



^x,y 
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Similarly, we have / | ^'iJ fP^\dm, < \\fi' Jl^^i^^^ if i' < k and / l/^^'^m^^ < \\hJL\^,,) 

a i' = k for [Z]-u.a.e. x,y. This implies that either E((8)fro//'^l^i(^)) or 
converges to zero in probability for [/]-u.a.e. x,y, so 

j <^ljP^dm,_^^0 as j-oo 

for [Z]-u.a.e. Xg, . . . , Xj^. since ker -0 has full projections on coordinates. □ 

Corollary 3.9. For [l]-u.a.e. Xg, . . . , Xj, the measure m^^ = m^^ is ergodic. 

Note that even for a non-ergodic invariant measure on a regular system there may exist 
generic points, so the mere fact that x is generic for does not suffice. 

Proof. In order to see that m^^ is ergodic it suffices to verify that for any continuous 
functions /^^ G C{XJ we have 

1 ^ r 

^3-10) ^TNT.<of'''(T^^^- ®-=o//'dm,o...x. for m,„,„„,^-a.e. 

n=l J 

Recall that for [Z]-u.a.e. Xg, . . . , Xj, the limit on the left-hand side of (I3.10D exists for m^^ j^^- 

a.e. ^ by Lemma [3^ and equals j (8)|Lo//'^dm|. Splitting the /;^'s as before it suffices to 
verify (I3.10D for the main terms, and this follows directly from (13. 8D . □ 

The sufficient special case of convergence to zero. The last hypothesis of Proposition l2 . 1 11 
is a certain special case of its conclusion. Recall that we already have u.a.e. convergence to 
zero on (as defined in (13.31) ). but not yet in the required sense. This is now corrected 
using Lemma [2.12l 

Lemma 3.11 (Change of order in the cube construction). Let I e N and P be a statement 
about points o/]~[^^gX['^^-'. Assume that for [I + l]-u.a.e. Xg, ...,Xj^we have P(xo, . . .,x^). 
Then for [l]-u.a.e. XQ,...,Xj^,form^^^^-a.e. x', we have P{xq, . . . ,Xj^,x'). 

Strictly speaking, the coordinates of x' in (xg, . . . , X;^, x') should be attached to Xg, . . . , x^,. 
but we do not want to introduce additional notation at this point. 

Proof The base case = follows directly from (I2.10D . 

Assume now that fc > 0. By the inductive hypothesis of this section the conclusion holds 
for fc — 1, so for [/]-u.a.e. Xg, . . . , Xj,_i,/or m^^ ^^_^-a.e. x',for every y^. e ;+i and my^-a.e. 
x^, we h ave P( xg, . . . , x^.j, x', x^.). 

Using fl2.10D we can rewrite the emphasized part of the statement as "for m^^ ^^_^-a.e. 
x' , for every y^^ e Yy. i, for every ergodic component jUg of (rHy^)^ from a fixed full measure 
set, for jUg-a.e. x^^" The conclusion follows by Lemma 12.121 and Lemma [3T7l □ 

Lemma 3.12. Let 1,1' G'N and assume \CF\ k. I + V). Then for [l]-u.a.e. Xg = (xg, . ..,Xj^), for 
m^^-a.e. x^, . . . , for m^j^ -a.e. X;/ the ergodic averages of the function <2>jLg//'^' "' converge 
to zero at (xg, . . . , X;/). 

Again, the tensor product ®*Lo//'^' ^ should be arranged in a different order, but in our 
opinion the above notation makes our goal more clear: it is not the function but the order 
in which we build the product space that changes. 

Proof We use induction on V. The case = is precisely Theorem 13.21 Assume that the 
conclusion is known for I + 1 and V — 1. The claim for I and V follows by Lemma l3.11[ □ 

Corollary 3.13. Let l.V Gf^ and assume \CF[ k. I + V\ Then for [l]-u.a.e. Xg, . . . , x^^ we /lave 
/J']®---®/,['U2f,(m,^_,J. 
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Proof. This follows from Lemma l3.12l by Lemma |3^ the definition of cube measures (12.31) . 
the characterization of uniformity seminorms (I2.5D and the ergodic theorem. □ 

Proof of Theorem \3.2\ f or k + 1. Assume first ICFf fc. Z). Then Lemma 13.1 21 with V — 1 states 
that for [Z]-u.a.e. x — (xg, . . . , x^), for m^^ ^^-a.e. x' , for any Z,^ e D; we have 

n=l 

For [Z]-u.a.e. XQ,...,Xf^ we obtain genericity wrt. m^^ by Theorem 13.41 ergodicity of 
n^xo,...,Xfc by Corollarv l3.9l and orthogonality of iS'lLo//'"' to the Kronecker factor of m^^ by 
Corollary[3ll3j so Proposition[2lII]withX = (X^'^ x • • • xX^'^, m^^ and Y = {X^^l^, m^^^J 
implies the claimed convergence to zero [Z]-u.a.e. 

This takes care of the terms /i_e,_L ii^ the splittings /; ^ = /; e,_L + fi,e,3^j + fi,e,err,j given 
by IDecf fc + Z + 1 — i) (resp. + I for i = 0). By an approximation argument like in 
the proof of Lemma 13.71 it suffices to consider the main terms, so we may assume that 

Y\'i=ofi'^^(.(.'^i^^T^^ ^ nilsequence. The claimed convergence a.e. in Xj^^^ then follows 
from the Wiener-Wintner theorem for nilsequences [HK09, Theorem 2.22]. The limit is 
zero a.e. in Xj,+^ provided that Z^+i^e -L ^i+i(^k+i) foi' some e by definition of Y^+i ; and 
Lemma [221 □ 

4. Wiener-Wintner return times theorem for nilsequences 

The first step in the proof is the identification of characteristic factors in the spirit of 
IIALR951 §4]. We refer to IIEZK12II for the notation. 

Lemma 4.1. Let f e L°°(X;X i = 0,...,k, and assume \CF[ k. I). Let further G/F be a nilman- 
ifold, G, a filtration on G of length I, and let aMaVcev basis adapted to G, be fixed. Then for 
u.a.e. Xq,...,x^ we have 



(4.2) lim sup , \-YFigin)r)Y~\fiT"xJ 

gGP(Z,G.),feW''-2'(G/r) n=l !=0 

where k = S!=i(^r - dr+i){/_i)- 



= 0, 



Proof By Corollary \3A3\ we have ®|Lo/i 1 2^iiT^x) for u-a.e. x G Xq x ■ ■ ■ x and by 
Theorem 13.41 u.a.e. x is fully generic for (8);/; w.r.t. m^^. The claim follows by the uniform 
Wiener-Wintner theorem riEZK12L Theorem 4.1]. □ 

Theorem II. 31 now follows from equidistribution results on nilmanifolds. 

Proof of Theorem [773] f or the given k. By Lemma |4.1| it suffices to consider/; e L°°(S';+|i,+i_;(X;)). 
By the pointwise ergodic theorem we can assume that each /; is a smooth function on a 
nilsystem factor of . The conclusion now follows from equidistribution results on nilman- 
ifolds IILeiOSl Theorem B] . □ 
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